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T. E. HARRIS'S CONTRIBUTIONS TO RECURRENT MARKOV 
PROCESSES AND STOCHASTIC FLOWS 

By Peter Baxendale 
University of Southern California 

This is a brief survey of T. E. Harris's work on recurrent Markov 
processes and on stochastic flows, and of some more recent work in 
these fields. 

1. Introduction. I was a colleague of Ted Harris at USC, first as a sab- 
batical visitor for the academic year 1982-1983 and later as a regular faculty 
member from 1988 until his retirement. During this time, I spent many hours 
in discussions with Ted, and appreciated his insight into all areas of prob- 
ability theory. I feel deeply grateful for the opportunity to have learned so 
much from him. 

This paper covers two areas of Harris's work. Early in his career he wrote 
his seminal paper [16] on the existence and uniqueness of stationary measures 
for Markov processes satisfying a certain recurrence condition. Nowadays, 
this is called Harris recurrence, although Ted was far too modest a person to 
ever use the term himself. Section 2 contains a brief account of [16] together 
with some indication of later developments based on his idea. 

Later in his career Harris became interested in stochastic flows. These can 
be regarded as random mappings of an entire state space into itself. Section 3 
starts with some details of Harris's construction of a stochastic flow as a limit 
of random stirring processes. (Here, we see a transition from Poisson point 
processes and percolation theory into processes more frequently described by 
stochastic differential equations.) The rest of this section describes Harris's 
work on isotropic stochastic flows and coalescing stochastic flows, together 
with some more recent developments in these areas. 
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2. Harris recurrence. Consider a (time homogeneous) discrete time Markov 
process {X n :n > 0} taking values in a measurable space (S,B). For simplic- 
ity here, we will assume B is separable, although many results are valid in a 
more general setting. Let P(cc, •) denote the transition probability function, 
and let ¥ x denote the law of the process with initial condition Xq = x. A 
nonzero measure [i on (S, B) is a stationary measure for the Markov process 
if 

(i(A) = [ W>(x, A)fi(dx) for all AeB. 

Definition. For any set A £ B, define the hitting time Ta = inf{n > 
1 :X n £ ^4}. Let m be a cr-finite measure on (S,B). The Markov process is 
said to be m-recurrent if P x (Ta < oo) = 1 for all x £ S whenever m(A) > 0. 

Nowadays, we say the Markov process is Harris recurrent if it is m- 
recurrent for some nonzero m. 

Theorem 2.1 (Harris [16]). Assume the Markov process {X n : n > 0} is 
m-recurrent for some nonzero a -finite measure m. Then there is a stationary 
measure /j, for the Markov process. Moreover, /i is unique up to a constant 
multiplier, and m is absolutely continuous with respect to fx. 

In the case of a discrete state space S, this result was already known; see 
Derman [13] and Chung [7]. The usual definition of recurrence to a point 
a £ S corresponds to m-recurrence with m taken to be the Dirac measure 
S a at the point a, and then the measure 

oo 

^) = ^P a (I n GB,T {a} >«), BeB, 

n=l 

is stationary. In this setting, the times of visits of X n to the recurrent point 
a are renewal times for the Markov process. However, with a nondiscrete 
state space S it will typically be necessary to look for recurrence to larger 
sets A than just singleton sets, and then the locations of the visits within A 
are of interest. 

For fixed A^B with m{A) > 0, denote by the times of visits to A, 

so that T A 1] = T A and T A k+1) = inf{n > : X n G A} . The Markov process 
Z m = X ( m ) with values in A is called the process on A. The transition 

A 

probabilities for the process on A will be denoted P^(x,-). The following 
result, combining several lemmas in Harris [16], gives a characterization of 
the stationary measure showing how the discrete space result is extended. 
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Proposition 2.1. Assume the Markov process {X n : n > 0} ism-recurrent 
for some nonzero a -finite measure m, and that m(A) > 0. Suppose that lia 
is a stationary probability measure for the process on A and that m is abso- 
lutely continuous with respect to \la on A. Define ll on (S,B) by 



Then \i is a stationary measure for the Markov process {X n : n > 0} on S and 
m is absolutely continuous with respect to fx. Moreover, any other stationary 
measure is a multiple of \x. 

It remains to consider the existence of the stationary probability lia for 
the process on A. At this point, we see the conflicting requirements for the 
set A: typically it has to be larger than a singleton so that m{A) > 0, but 
it should be chosen small enough so that the process on A has some good 
behavior. 

Let p n (x, y) denote the absolutely continuous part of the n-step transition 
probability P n (x,-) with respect to m. The following technical lemma of 
Harris is based in the idea that m-recurrence implies that for each x, the 
set T(x) = {y G S :p n (x,y) = for all n > 1} must satisfy m(T(x)) = 0. 

Lemma 2.1. Assume the Markov process {X n :n > 0} is m -recurrent for 
some nonzero a -finite measure m. For any r G (0, 1), there exist A£ B with 
< m(A) < oo, a positive integer k and a positive constant s such that for 
all x £ A, 



This enables Harris to obtain a Doeblin-like condition on the transition 
operator R(x, •) = (Pa(x, •) + ••• + P\{ x ^ an d the existence of the sta- 
tionary probability [xa follows directly. 

2.1. Small sets. 

Definition. A set A G B is a small set if there exist a positive integer 
k, a probability measure v and a constant f3 > such that 



(2.1) 




m{y G A:p 1 {x^y) + • • • +p k (x,y) > s} > rm(A). 



(2.2) 



p fe (x,-) >M-) 



for all x € A. 



The following result is a strengthening of Harris's Lemma 2.1. 
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Proposition 2.2 (Orey [40]). Assume the Markov process {X n :n > 0} 
is m-recurrent for some nonzero a-finite measure m. Every set E G B such 
that m{E) > contains a set A^B such that < m(A) < oo and 

m.i{p k (x,y) :x,y G A} > 

for some positive integer k. 

Corollary 2.1. The Markov process {X n :n > 0} is m-recurrent for 
some nonzero a-finite measure m if and only if there exists a small set 
A£B such that F X (T A < oo) = 1 for all x G S. 

Proof. Any set A with the property described in Proposition 2.2 is 
a small set, with v(B) = m(AC] B)/m(A). Conversely, suppose A has the 
properties described in Corollary 2.1, with k, v and j3 as in (2.2). If v{B) > 0, 
then P x (X n+k G B) > f3u(B)P x {X n G A). Since P X (T A < oo) = 1 for all x G S, 
then P x (X n G A infinitely often) = 1 and so P x (X n G B infinitely often) = 1. 
It follows that {X n : n > 0} is ^-recurrent. □ 

This result shows that in some sense the study of Harris recurrence is 
equivalent to the study of small sets with almost surely finite hitting times. 
The property that A is small can be used in two distinct but related ways. 
For convenience assume here that A is a small set with k = 1. The general 
case can be handled by applying the methods described below to the /c-step 
process {X n k : n > 0}. 

2.1.1. Coupling. Two copies {X n : n > 0} and {X' n : n > 0} of the Markov 
process can be coupled so that ¥{X n+ \ = X' n+1 \X n , X' n ) > (3 whenever 
(X n ,X' n ) £ Ax A. This implies that two copies {Z m : m > 0} and {Z' m : m > 
0} of the process on A can be coupled so that P(Z m+ i = Z' m+1 \Z m , Z' m ) > (3. 
It follows easily that the process on A has a unique stationary probability 
HA, say, with 

wn(x,-)-HA\\<2(i-pr 

for all m > and x G A. Details of this coupling argument may be found in 
Lindvall [29]. 

2.1.2. The split chain. A set C is said to be an atom for the Markov chain 
{X n : n > 0} if P(x, •) = P(y, •) for all x, y G C. If C is an atom, then the times 
of visits to C are renewal times for the Markov chain. The excursions away 
from C will be independent and identically distributed, and many questions 
about the large time behavior of the Markov chain may be resolved using 
this fact. 
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A singleton set C = {a} is an clearly an atom. Nummelin [38] showed how 
to use a small set A to build a Markov chain with an atom. [More generally, 
Nummelin assumes a minorization condition ¥(x,C) > h{x)v{C) with some 
nontrivial nonnegative function h; here, we specialize to h(x) = f31 A ( x )-] 
Consider the split chain {(X n ,Y n ) :n > 0} with state space S x {0,1} and 
transition probabilities given by 

P{Y n = l\F*VFj l „ l } = 
P{X n+1 eA\T*vJFZ} = 



Here, = a{X r :0<r<n} and T% = a{Y r :0<r<n}. Thus, the split 
chain evolves as follows. Given X n , choose Y n so that P(Y n = 1) = /31c(X n ). 
If Y n = 1 then X n+ \ has distribution whereas if Y n = 0, then X n+ i has dis- 
tribution (P(X n , •) -pi c {X n )v)/(l-pi c (X n )). The split chain {{X n , Y n ) : n > 
0} is designed so that it has an atom 5 x {1}, and so that its first component 
{X n : n > 0} is a copy of the original Markov chain. 

Thus, renewal theory can be used to describe the large time and stationary 
behavior of the split chain {(X n , Y n ) : n > 0}, and hence of its first component 
{X n :n>0}. 

Much more information about small sets, and more generally about the 
ergodic theory of Harris recurrent Markov chains, may be found in the books 
of Revuz [42] , Nummelin [39] and Meyn and Tweedie [36] and the references 
therein. 

2.2. Positive recurrence and rates of convergence. A Harris recurrent 
Markov process {X n : n > 0} is to be positive Harris recurrent if the station- 
ary measure \i can be normalized to be a probability measure on (S,B). 

Suppose that A is a small set with < m(A) < oo, and recall that fi^ 
denotes the stationary probability measure for the process on A. The sta- 
tionary measure /U given by (2.1) has total mass 

ti(S)= [ W{T A )n A {dx). 

J A 

Thus, the issue of positive recurrence depends on estimates of the expected 
hitting times K x (Ta) for x G A. 

Proposition 2.3 (Tweedie [43]). Assume A is a small set. Suppose 
there exist a measurable function V > and constants c > 0, k such that 
PV(x)<V(x)-cforx(£AandPV(x)<kforx£A. Then sup xeA E x (T A ) < 
1 + k/c and so the Markov process is positive Harris recurrent. 



n c (x n ), 

P(X n ,A)-/31 c (X n )v(A) 
1 - pi c (X n ) 



if Y n = l, 
if Y n = 0. 
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The proof is based on the fact that the first inequality assumed for the 
Lyapunov-Foster function V implies that when Xq ^ A the process V(X n ) + 
cn stopped at time Ta is a supermartingale. With stronger assumptions on 
the function V, together with conditions to ensure aperiodicity, results may 
be obtained concerning the rate of convergence of P n (x, •) to the stationary 
probability measure it, say. Several such conditions are given by Meyn and 
Tweedie [32, 36]. The following result includes also stronger conditions on 
the small set A so as to ensure aperiodicity. 

Proposition 2.4 (Meyn and Tweedie [32]). Assume the set A^B sat- 
isfies P(x, •) > ftv{-) for all x G A, where (3 > and v{A) = 1. Assume also 
there exist a measurable function V > 1 and positive constants A < 1 and 
k such that PV(x) < XV (x) for x ^ A and PV{x) < k for x € A. Then the 
Markov process has a unique stationary probability measure tt, say. More- 
over, there exist positive constants 7 < 1 and M with the property that 



for all x £ S and n > whenever f : S — >■ K is a measurable function such 



More details about estimates of the form (2.3) can be found in Chapter 
16: ^-uniform ergodicity of [36]. In Meyn and Tweedie [37] it is shown that 
the constants M and 7 can be chosen depending only on A, k and j3. See 
also Baxendale [4]. 

2.3. Continuous time Markov processes. 

2.3.1. Sampled chains. Suppose that {Xt :t > 0} is a time homogeneous 
Markov process, and that {T(n) : n > 0} is an independent undelayed re- 
newal process with increment distribution a, for some probability distribu- 
tion on (0,oo). Then the sampled chain {Y n :n > 0} defined by Y n = Xx[ n ) 
is a time homogeneous Markov chain. If Pt(x,-) denotes the time t tran- 
sition probability function for {Xt :t > 0}, then {Y n :n > 0} has transition 
probability function K a (x, ■) = J Pt(x, -)a(dt). 

The A-skeleton chain Y n = X n /\ corresponds to the deterministic a = 5a- 
Alternatively, the resolvent chain observes the process X at the times of a 
rate 1 Poisson process and has transition probability 



which is the resolvent of the original continuous time process. The advan- 
tages and disadvantages of various choices for a, and the connections be- 
tween the theories of Harris recurrence for continuous time and discrete 



(2.3) 




that \f(y)\<V(y) for allyeS. 
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time Markov processes are discussed in the papers [33, 34] of Meyn and 
Tweedie. Results analogous to those in Propositions 2.3 and 2.4, involving 
the action of the infinitesimal generator L on V, are given in Meyn and 
Tweedie [35]. 

2.3.2. Stopping times. An alternative approach to recurrence for con- 
tinuous time Markov processes was developed independently by Maruyama 
and Tanaka [30] and Khas'minskii [21]. Assume that {Xt :t > 0} is a strong 
Markov process with right continuous paths and left limits on a separable 
metric space S. Suppose that D\ and D 2 ar e open sets with disjoint clo- 
sures D\ and D 2 with the property that the stopping times T_d 1 = inf{t > 
0:Ii€Di} and T D2 = inf{t >0:X t £D 2 } are both P x -almost surely finite 
for all x G S. Define inductively sequences a n and r n of stopping times by 
(To = inf{i > 0:X t G D{\, r n = inf{t > a n : X t G D 2 } for n > 0, and cr n = 
inf{t > r n _i : G Z?i} for n > 1. Then {Y n = X an : n > 0} is a time homo- 
geneous Markov chain on D\. 

Assume for the moment that the process {Y n : n > 0} on Z?i has a sta- 
tionary probability measure fJ-jj 1 - Then it is shown in [30] and [21] that the 
measure \x on (S,B) defined by 



is a stationary measure for the original continuous time Markov process {Xt : 



So far this is a very natural extension to continuous time of Harris's 
result in Proposition 2.1. It remains to show the existence of the stationary 
probability measure (ip , and this is where both [30] and [21] impose extra 
conditions on the process {Xt :t > 0}. In particular, they both use additional 
properties of the hitting distribution P x (Xt Di G B) for x G D2 and B C D\ 
which ensure that the process {Y n : n > 0} satisfies Doeblin's condition (D). 

The representation (2.4) can be used to convert occupation time esti- 
mates for the process {Xt : t > 0} in a very direct way into estimates on 
the stationary measure fi, see, for example, Baxendale and Stroock [6] and 
Baxendale [3]. 

3. Stirring processes and stochastic flows. 

3.1. Random stirring in M. d . Let </> : R d — > M. d be a homeomorphism of R d 
onto itself such that <f)(x) = x whenever ||x|| > K, for some K. The mapping 
can be thought of as a stirring of M. d centered at the origin G M d . (Harris 
used the term "stirring" originally in the case where the homeomorphism 
<j) is volume preserving, but it is convenient to keep the term in this more 



(2.4) 




t>0}. 
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general setting.) For a € M d , the translated mapping (f> a (x) = a + <fi(x — a) 
represents stirring centered at a. 

Consider a Poisson point process on M. d x (0, oo) with intensity Xdxdt. 
For each atom (a,t) of the point process, apply the mapping <p a at time t. 
Then for < s < t < oo the value X s t of the stirring process is the random 
mapping of M. d to itself obtained as the composition of the stirrings (f) a at 
times u for all the atoms (a, u) with s < u < t. A percolation argument, using 
the fact that 4>(x) — x has bounded support, can be used to show that for 
sufficiently small t — s the restriction of the mapping X s t to any bounded 
set is almost surely given by the composition of a finite number of stirrings. 
It follows that the process {X s t : < s < t < oo is well defined. 

The essence of this construction can be seen in the paper by Harris [17] 
on the construction of an exclusion process with nearest neighbor rates. In 
[17], he considers a point process on the set of bonds of the integer lattice 
7L d and the corresponding stirring switches the two ends of the bond. The 
construction in [17] allows the rates to depend on the local configuration, but 
in the simplest case of constant rate it fits into the setting above. Random 
stirring on the real line is studied in the paper [28] by Harris's student W. 



3.1.1. Convergence to a stochastic flow. Consider the effect of letting the 
magnitude of the displacement involved in each stirring <f> tend to zero while 
letting the intensity of the Poisson process tend to infinity. More precisely, 
fix a compactly supported vector field V on M. d and let 4> n denote the time 
l/\/n flow along V. Let {X™ t : < s <t < oo} be the random stirring process 
obtained using the stirring function 4> n together with a Poisson process with 
rate nXdxdt. Assume the centering condition 



Then under appropriate smoothness conditions on the vector field V the 
processes {X™ t : < s < t < oo} converge weakly to a process {X st : < s < 
t < oo} with values in the group of homeomorphisms of M. d . This is proved 
in Harris [18] for the case d = 2 when V is divergence free and rotationally 
symmetric. A more general form of result (although on a compact manifold) 
is given in Matsumoto and Shigekawa [31]. 

The process {X st : < s < t < oo} has the properties: 

(i) for each x S M. d and s > the mapping t — > X s t(x) is continuous; 

(ii) Xt u o X s t = X su whenever s <t <u; 

(iii) if si < t\ < S2 < t2 < • • • < s n < t n the mappings X Si t, , 1 < i < n are 
independent; 

(iv) the distribution of X st depends only ont-s. 



C. Lee. 
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Any process with these properties will be called a (time-homogeneous) stochas- 
tic flow on M d . Much information about stochastic flows may be found in 
the books of Kunita [22] and Arnold [1]. 

The law of a stochastic flow is determined by the laws of its fc-point 
motions 

t ->• (Xot (xi ) , Xot (X 2 ),..., Xot (x k ) ) G R* 

for all A; > 1 and x±,X2, ■ ■ ■ ,Xf- S K d . It is easy to see that each /c-point 
motion is a Markov process on M. dk . Under suitable regularity conditions, 
the fc-point motions are diffusion processes and the infinitesimal generator 
for the k point motion can be explicitly written in terms of the generator 
L, say, for the one-point motion and a covariance matrix B(x,y) for the 
two-point motion. In particular for / £ C 2 (K d ) with compact support, 



and 



r( 1J ). 5 t ra-'- j ;w»»-")i 

The operator L and the matrix function B are related by the fact that 
B(x,x) is the symbol of the operator L. Together, L and B are called the 
local characteristics of the flow; see Le Jan and Watanabe [27]. For the 
stochastic flow constructed above as the limit of random stirring processes, 
the operator L has constant coefficients and B(x,y) depends only x — y. 
This implies that the law of the stochastic flow is homogeneous in space 
as well as time. For the example on M. 2 considered by Harris in [18], the 
rotational invariance of the vector field V implies that law of the stochastic 
flow is invariant under rigid motions of M. 2 , and in particular the one-point 
motion is Brownian motion (up to a scaling factor). 

3.2. Isotropic stochastic flows. A stochastic flow {X s t : < s < t < oo} is 
isotropic if its law is invariant under rigid motions of W 1 . Harris [18] studied 
incompressible isotropic stochastic flows on 1R 2 , and Baxendale and Harris 
[5] and Le Jan [23] studied the general d-dimensional case. 

For an isotropic stochastic flow, the generator L for the one-point mo- 
tion is a multiple of the Laplace operator A, and the law of the flow 
is determined by the covariance matrix B. Invariance under translations 
implies B(x,y) = B(x — y,0), and then invariance under rotations implies 
B{x) = B(x,0) satisfies B(x) = G*B(Gx)G for all real orthogonal matri- 
ces G. This condition gives a representation of B using Bessel functions; 
see Yaglom [44] and Ito [20]. A corresponding representation for isotropic 
stochastic flows on a sphere S rf appears in Raimond [41]. 
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The isotropy condition implies that certain geometric properties of the 
flow can be calculated explicitly. For example, the length \\vt\\ of a tan- 
gent vector vt = DXot(x)(v) is a geometric Brownian motion and the top 
Lyapunov exponent Ai = limt-Kx, t log can be calculated explicitly in 
terms of B. Other local geometric properties such as the curvature of a sub- 
manifold of M d have been calculated; see Le Jan [24] and Cranston and Le 
Jan [8]. 

Of more interest are results involving the joint behavior of infinitely many 
points. A result of Baxendale and Harris on the length of a small curve 
in the case Ai < has recently been sharpened by Dimitroff [14]. Results 
of Cranston, Scheutzow and Steinsaltz [10, 11] show that, while ||Xot(a;)|| 
grows like \ft for each fixed x, if D is a nonsingleton connected set in R d for 
d > 2 and the isotropic stochastic flow has Ai > then sup{||Xot(y)|| : y G D} 
grows almost surely linearly as t — > oo. 

For any measure (distribution of mass) v, let vt denote the induced ran- 
dom measure v o X^ 1 . Zirbel [45] contains estimates on the first two mo- 
ments of Uf. Recently Cranston and Le Jan [9] and Dimitroff and Scheut- 
zow [15] have proved asymptotic normality of the rescaled random measure 

3.3. Coalescing flows. For vector fields Vo, Vi, V2, ... on M d and indepen- 
dent scalar Brownian motions {W{ : t > 0}, {Wf : t > 0}, . . . , consider the 
stochastic differential equation 

(3.1) dx t = V (x t )dt + Y / V a (xt)dW t a . 

a>l 

Under suitable regularity and growth conditions on the vector fields Vo, V\ , V2, ■ ■ ■ , 
the strong solutions of (3.1) for different initial conditions can be pieced to- 
gether to give a stochastic flow {X s t : < s < t < 00} of homeomorphisms 
M. d ; see, for example, Kunita [22]. The local characteristics of the flow are 
the operator 

*)=Ew|w^ E £^(*ra*)^^(*) 

p=l p,q=la>l 

and the covariance function 

(3.2) B^{x,y) = Y J V^x)V^y). 

Conversely, any stochastic flow in which L and B have sufficiently smooth 
coefficients arises as the solution of a stochastic differential equation (taking 
the V a to be an orthonormal basis of the reproducing kernel Hilbert space 
of B) and the flow consists of homeomorphisms. 
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Harris [19] introduced the study of coalescing stochastic flows. These are 
ones where the mappings X s t may be many to one. Harris studied the case 
d = 1 with continuous homogenous (in space) covariance function B, and 
obtained conditions for coalescence in terms of the modulus of continuity of 
B at 0. (In contrast, the "Arratia flow" of independent coalescing Brownian 
motions; see [2], has discontinuous B = l{o}-) 

The issue of the existence of nonhomeomorphic stochastic flows in di- 
mensions d>2 was addressed by Darling [12]. More recently, Le Jan and 
Raimond [25, 26] have developed new techniques to interpret the stochastic 
differential equation (3.1) when the covariance function B given by (3.2) is 
non-Lipschitz. In this more general setting, there is not only the possibility 
of coalescence; there is also the possibility that the solution of (3.1) has to be 
interpreted as a flow of probability kernels. The flow of probability kernels, 
rather than a flow of mappings, corresponds to the lack of uniqueness in the 
solutions of (3.1). Examples of such flows include flows on Euclidean space 
M. d and spheres § rf with isotropic, but non-Lipschitz, covariance functions B. 
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